SETS OF INTEGERS WITH NO LARGE SUM-FREE SUBSET 



SEAN EBERHARD, BEN GREEN, AND FREDDIE MANNERS 

Abstract. Answering a question of P. Erdos from 1965, we show that for 
every e > there is a set A of n integers with the following property: every 
set A' C A with at least ( ^ + e) n elements contains three distinct elements 
x, y, z with x + y = z. 
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1. Introduction 

An old argument of Erdos |Erd65j shows that every set A of n nonzero integers 
contains a subset A' c A of size \A'\ |n which is sum-free, meaning x + y = z 
has no solutions with x,y,z € A'. The argument is simple: for 8 € R/Z the set 
of i e 4 such that | < {to} < | is clearly sum-free, and if 9 is chosen uniformly 
at random then the expected size of Ag is ^n, so \Ag\ ^ for some 9. 

Let /(n) be the largest k such that every set of n nonzero integers contains 
a sum- free subset of size k. Erdos's lower bound f(n) ^ has not been much 
improved. As pointed out by Alon and Kleitman AK90 , Erdos's argument can be 
modified to show f(n) ^ + 1): if 9 ks then Ag is empty, so for some 9 we 
must actually have \Ag\ > -|n, so \Ag\ ^ i(n 4- 1). The best known lower bound is 

due to Bourgain |Bou97j . who showed /(n) > |(n + 2) for n > 3 using an elaborate 

l 
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Fourier-analytic technique. In particular it is unknown whether f(n) ^ + oj(n) 
for some uj(n) — > oo, though this seems likely. 

In the opposite direction, considering the largest element of a subset A C 
{l,...,n} gives an obvious upper bound of f(n) ^ i(rt + 1). Improvements to 
this upper bound have all implicitly used the following device. Suppose that A is 
a set of size m with no sum- free subset of size larger than /(m) and that B is a 
set of size n with no sum-free subset of size larger than /(n). Then if M G N is 
sufficiently large A U MB is a set of size m + n with no sum-free subset of size 
larger than f(m) + f{n), so f{m + n) ^ f{m) + f(n). This condition is well known 
to imply that f(n)/n converges to inf f(n)/n, so to show f(n) ^ cn + o(n) it suf- 
fices to find a single set A with no sum-free subset of size larger than c\A\. Let 
a = limf(n)/n = inf f(n)/n. 

In |Erd65j Erdos mentioned that Hinton proved a ^ jg ~ 0.467. He also pointed 
out, attributing the construction to Klarner, that the set A = {2,3,4,5,6,8,10} 
shows tr ^ = w 0.429. Using a set of size 29 Alon and Kleitman [AK90 j showed 
a ^ M ~ 0.414. Malouf |Mal94] in her thesis (as well as Furedi, according 
to Guy |Guy04| ) used A = {1,2,3,4,5,6,8,9,10,18} to show a ^ § = 0.4. 
Lewko |Lewl0j used a set of size 28 to show ^ sa 0.393. Incidentally, in a 1992 
letter |Erdj to Klarner, Erdos claims this same bound of ||, but he includes no 
proof. 

Recently, Alon |Alol3j showed that for each n there exists m such that /(m) /m < 
f(n)/n. Thus there is no n such that f(n)/n = a. Applying this to Lewko's bound, 
Alon showed for instance that a ^ || — e for some e w io -50000 . 

The question of whether a = -| has been mentioned several times [AS081 ICL071 
Erd65] IErd73[ |Guy04| IKol96] . Our purpose in this paper is to answer this question 
affirmatively 

Theorem 1.1. There is a set of n positive integers with no sum-free subset of size 
greater than \n + o{n). 

By the above argument it suffices to find, for each e > 0, a single set A with no 
sum- free subset of size larger than (| + e) \A\. In fact we find a set A such that 
every subset A' of size larger than (-| + e) \A\ contains a solution to x + y — z with 
x 7^ y. This answers a further question asked in |Erd65j . 

One of the ingredients of our argument is a rough structure theorem for sets 
A satisfying conditions of the form |j4 — j4| < 4|t4|, which may be of independent 
interest. Specifically, if A is a set of integers with \A ~ A\ ^ (4 — e)|A| then A has 
density at least | + ce on some arithmetic progression of length 3> £ |A|. 
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2. Overview of the proof 

The proof of Theorem 11.11 breaks down naturally into several parts. We outline 
these informally here, and give an indication of how they combine. 

Note that there are certain local obstructions to a set A having the desired 
property, that is to say having no sum- free subset A' with \A'\ ^ (± + e)|A|. For 
instance, not more than (i + e) |^4| of the elements of A can be odd, as these form 
a sum-free set in A. We think of this as an obstruction coming from Z/2Z. Not 
more than (i + e) |^4| of the elements of A can be congruent to 2 or 3 (mod 5), an 
obstruction coming from Z/5Z. Similarly, not more than (i + e) |^4| elements can 
be contained in an interval [x, 2cc), an obstruction coming from R. 

In fact we shall see in Section |3] that, in some sense, these restrictions coming 
from Z/QZ for various Q and from K are the only obstructions to A having the 
desired property. 

To deal with these restrictions modulo Q and in R we consider a weight function 
w : Z/QZ x [0, 1] — > (0, oo). Roughly speaking, we will define a set A C {1, . . . , N} 
in such a way that a proportion w(x,y) of the elements of A lie near the value 
yN and are congruent to x (mod Q). The "local" version of our problem is then 
roughly the following. 

Problem 2.1 (Local problem). Find w such that if A C Z/QZ x [0,1] is open 
and if j, w(x, y)d/j, ^ ^ + e then A contains a summing triple x, y, x + y. Here, /j, 
denotes the uniform probability measure on Z/QZ x [0, 1]. 

In Section[3l we show that if w satisfies a slightly stronger version of Problem l2.ll 
(specifically Proposition l3.ll) . then a set A may be constructed from w as suggested 
above and Theorem 11.11 holds for this A. The actual construction of A, which 
involves a random selection argument, occurs at (|3.3[) . A crucial tool in showing 
that A has the required property (and elsewhere in the paper) is the arithmetic 
regularity lemma due to the second-named author and Tao |GT10| . The statement 
of this is recalled in Lemma [A. 21 

The remainder of the paper is concerned with constructing the weight function w, 
that is to say with solving the local problem. This construction is rather involved. 
At its heart is an iterative argument (Lemma l5.2[) allowing us to take a near-solution 
u>, such that if J A w(x, y)d[i ^ a then A contains contains many summing triples, 
and improve it to a nearer-solution w' ', with corresponding parameter a' < a. The 
sequence a, a' , a", . . . obtained in this way converges rapidly to | + e. 

The main driver for this iterative argument is a structural result concerning sum- 
free (or almost sum- free) subsets of Z/QZx [0, 1] with (uniform) measure just a little 
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more than ^. The crucial result here is Corollary 15. 1[ which states that such sets 
"avoid zero", i.e., have very little mass on H x /, where H ^ Z/QZ is a subgroup 
of small index and / C [0, 1] is a (not too small) open interval containing 0. Thus 
if w is chosen to have a lot of its mass concentrated on H x /, then J. w{x, y)dfj, is 
small whenever fi(A) is a bit more than i. The iteration then works in some sense 
by applying the same arguments to A n (H x I). In particular, the weight w we 
construct blows up near zero. 

The proof of Corollary 15.11 rests on the rather lengthy arguments of Section |H 
which concern the structure of open sets A C Z/QZx [0, 1] with fi(A — A) ^ 4fj,(A) — 
e, where [i denotes the uniform measure (and generalisations of this statement). 
The key result here is Corollary 14.21 This in turn is deduced from Theorem 14.11 
which concerns sets of integers Ac {1, . . . , N} satisfying the same condition, that 
\A — A\ ^ 4 1 .A | — eN. The conclusion is that they have density at least \ + ce on a 
progression of length 3> 6 N, a result which may be of independent interest. (This 
theme is elaborated upon briefly in Section [51 which is independent of the rest of 
the paper.) The proof of Theorem 14.11 uses the arithmetic regularity lemma again, 
as well as an application of the Brunn-Minkowski inequality for open subsets of R 2 . 

On account of our double application of the arithmetic regularity lemma, the 
o(n) term in Theorem 11.11 is more or less ineffective. The authors believe that 
main obstacle to a more effective o(n) here is the use of the arithmetic regularity 
lemma in Section |4j which for all we know could be replaced by more elementary 
arguments. 

Notation. We will introduce various pieces of notation as we go along. Through- 
out the paper we will also use the following at least somewhat standard notations. 

The expression OA t ,...,A k (1) denotes a constant which may depend on A\, . . . , Ak, 
and O Ax,...,A k (X) = OA 1: ...,A k {l)Y. If we write X -CAi,...^ Y then we mean 
that X ^ OA lt ...,Ak(Y)- The expression oa 1} a 2 , ...,A fc ;A r ^-oo(l) denotes an expression 
which tends to zero as JV — > oo, the rate at which this happens being possibly 
dependent on the parameters A\, . . . ,A^. On account of its relative ugliness we 
will use this notation sparingly. 

If / : {1, . . . , N} — !> C is a function then we write 

ii/ii*w= f^Ei/Hi 

We will use this only when p = 1 or 2. The normalisation, which is perhaps 
nonstandard, ensures that ||/||^(jv) < ll/ll^ 2 (JV) < ll/||oo- 
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3. The main argument 



In this section we prove Theorem 11.11 assuming the existence of a weight function 
w : Z/QZ x [0, 1] — > (0, oo), the role of which was briefly outlined in the preceding 
section. Proposition 13.11 below, whose proof will occupy Sections @] and specifies 
the properties we shall require of w. Before stating this proposition we introduce 
some pieces of nomenclature. 

We will view both Z/qZ x [0, 1] and Z/qZ x [0, 1] x (E/Z) d , for various integers q 
and d, as metric spaces. On each of these spaces X we place an "obvious" metric, 
namely a suitable product metric of the discrete metric on Z/qZ and the Euclidean 
metrics on [0,1] and on (M./Z) d . This allows us to talk about Lipschitz functions 
on X: if F : X — > C then we define ||F||Li P to be the infimum of all constants K 
such that \F(x) - F(x')\ Kd(x,x') for all x,x' G X. 

We will also put natural measures on these spaces X, which we will always 
denote by fi (more precise notation such as /iz/<jZx[o.i] would be rather ugly and 
unnecessary) . The measure [i will always be the product of the uniform probability 
measures on each factor, namely the uniform measure on Z/qZ (which assigns mass 
1/q to each point), and normalised Lebesgue measure on [0, 1] and the torus (R/Z) rf . 

Finally, if X is one of the sets above and if : X — > C is a function then we 
define 



We also write T(A) = T(1a) if A C X . We use the same notation when X = 
{1, . . . , N} with the uniform probability measure, so if / : {1, . . . , N} — > C is a 
function we write 



By a weight function we simply mean a function w : Z/QZ x [0, 1] — > (0, oo) such 
that Jwdfi = l. If T = (R/Z) d and Q | q then by wxl T : Z/qZx[0, l]xT -> (0, oo) 
we mean the function given by (w x 1t)(^, y, z) — w(x (mod Q), y). 

Proposition 3.1. Let e > 0. Then there is an integer Q and a Lipschitz weight 
function w : Z/QZ x [0, 1] — > (0, oo) with the following property. If T = (R/Z) d 
and Q \ q, then for any continuous function $ : Z/qZ x [0,1] xT^ [0, 1] such that 
e we have T(^>) 3> e 1. 

Note that because Q and w depend only on e, there are constants ci(e) and d±{e) 
such that < ci(e) < w(x) < c[(e) for every x G Z/QZ x [0,1], and there is a 
constant L{e) such that ||io||lip ^ L(e). Choose 02(e) to be the implied constant in 
Proposition 13. 1\ so that the conclusion of the proposition is that T(^f) ^ 02(e). 




(3.1) 




(3.2) 
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The next lemma is quite standard. In it we encounter the notion of the Gowers 
£/ 2 -norm ||/||;y 2 (jv)! whose definition and relevant basic properties are recalled in 
Appendix [X] 

Lemma 3.2. Suppose that p : {1, ...,JV} —¥ [0,1] is a function. Then there is a 
set Ac{l,...,N} such that \\1 A -p\\u*(N) < 7V~ 1/4 . 

Proof. Choose A at random by including n in A with probability p(n), these choices 
being independent for different n. We claim that A has the required property on 
average. Write X n = 1a (n) —p{n). Then the random variables X n are independent, 
bounded by 1, and of mean zero. We have 

W^A — p\\u 3 (N) ^ ~jy3 ^2 X ni X n2 X n3 X, l4 . 



Bi+n2=n 3 +n 4 

The expected value of any term on the right hand side with ni,ri2,Ti3, ri4 distinct 

is 0. This accounts for all except 0(A 2 ) terms, and so E||l^ — ?*llu 2 (iv) 

The result follows immediately □ 

Fix e > 0, and let Q and w be as in Proposition 13.11 By Lemma T3.2I there is a 
set A such that 

1 A (n) = -r^—r. — w(n (mod Q),n/N) + g uni (n), (3.3) 

\m\oo 

where ||<7unf ||c/ 2 (at) — o(l). Since J w = 1 and w has Lipschitz constant O e (l), it 
follows from Lemmas IA.6I and IA.8I that 

1 4=mz +o ^ {1) - (3 - 4) 

We shall show that this set A satisfies Thcorcmll.il 



Theorem 3.3. Let N > iVo(e) be sufficiently large, let A be the set just constructed 
and suppose A' C A has no solutions to x + y = z . Then |j4'| ^ (^ + 2e)|A|. 

Proof. Let A' c A be a subset of A such that \A'\ > (| + 2e)|A|. We apply the 
arithmetic regularity lemma [GT10 to 1a>- The statement of this lemma is recalled 
in Lemma TA. 21 Let 

. ( c 2 (e) Cl (e) 3 e 1 \ 

o = mm , ; ; i , — 7-^--, (3.5 

V 20c; (e) 3 ' 4c£(e)' 100/ v ' 

and let T : N — > R + be a growth function, depending on e, to be specified later. 
Applying the regularity lemma with parameter i<5 4 and growth function T we 
obtain an integer M <C e .j^ 1 and a decomposition 

lA> = /tor + /sml + /unf , (3.6) 
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where \\f S mi\\p(N) < <5 4 /16, ||/ U nf||c/ 2 (JV) < 1/J"(M) and 

/tor(n) = F(n (mod g), n/iV, 0n) (3.7) 

for some M-Lipschitz F : Z/qZ x [0,1] x (M/Z) d -> [0,1] and some (F(M),N)- 
irrational 6 (M/Z) , where q,d ^ M. For the rest of this section, write T = 
(R/Z) d . 

We may regard the modg dependence of / tor as modgQ dependence instead 
without affecting the Lipschitz constant of F. Relabelling, we may assume that 
Q | q and q < e M. 

The property that A' is a subset of A manifests as an approximate upper bound 
for F in terms of the weight w. As the next lemma shows, by absorbing the error 
into / sm i we can assume that F ^ H^Hoc r{ w x It) pointwise. 

Lemma 3.4. Suppose T grows sufficiently rapidly depending on e and N ^ N (e, J 7 ) 
is sufficiently large. Then we can modify the decomposition (|3 . 6[) to \ A ' — / t ' or + 
/sml + /tmf where ||/ s ' ml ||«2 (A r) S 2 and / t ' or = F'(n (mod q),n/N,6n) for some 
O e (M)-Lipschitz function F' : Z/qZ x [0,1] x (R/Z) d -> [0,1] such that F' ^ 
\\ w \\oo( w x It) pointwise. 

Proof. Let F' = mm (F, \\w\\^(w x 1 T )), let / t ' or (n) = F'(n (mod q),n/N, On), and 
let /i = /tor — //or- ^ suffices to prove ||/i||^2(iv) ^ \5 2 . 
Now substituting in the definition of h we have 

= E n ^ N h(n) (/tor(n) - x l T )(n (mod Q),n/N,6n)) . 

Recalling that 1 A ' = /tor + /sml + /unf and that 

1a (n) = Hwll^^w x l T )(n(mod Q),n/N 7 0n) + g uni (n), 

we may rewrite this as 

ll^ll^(jv) = E n<ivK n ) (~(l A (n) - l^'( n )) ~ /smi(n) + (-/ U nfW + ffunf(n))) . 

To estimate this, we split into three terms as suggested by the bracketing. The 
first term is ^ since h ^ and \ A < ^ 1a pointwise. The second term may be 
estimated by the Cauchy-Schwarz inequality, remembering that h ^ 1 pointwise: 

E n <jv/i(n)/smi(ri) s$ ||/sbii|U 2 (jv)]]/i||^(jv) < ||/sroi||^(Ar) s$ ^S 4 . 

Finally, the third term is extremely tiny if T grows quickly enough, by Lemma lA. 91 
Putting all this together gives ||/i|| 2 2(jv) ^ an d the lemma follows. □ 
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Relabelling / t ' or as /tor, / s ' m i as /smi and F' as F, we may thus assume that ()3.6j) 
and (J3J1) hold with ||/ sm i||f2 (JV) < S 2 , || /unfile (jv) < V^W) and some e (M)- 
Lipschitz function F : Z/qZ x [0, 1] x (R/Z) d -> [0, 1] such that F HHI^C™ X 1 T ) 
pointwise, so in other words 

F = \\w\\^ ■ (w x 1 T ) 

for some continuous $ : Z/gZ x [0, 1| x T -> [0, 1]. By combining this decomposition 
with Proposition 13.11 and the counting lemmata in the appendix we can finish the 
proof of Theorem 13.31 

Using (|3.6[) . Cauchy-Schwarz and Lemma [A. 81 we have 
\A'\ 

-jj- = E nfiN F(n (mod q), n/N, On) + E nSiN f sml (n) + E n!iN f uni (n) 
< E n<N F(n (mod q), n/N, On) + 25. 
Thus by the Lipschitz property of F, the irrationality of 9 and Lemma lA.4l we have 
^4r < I Fdn + 36= [y-{wx 1 T ) dp + 36. 

N J \m\oa J 

If N > No(e) is large enough then p.4[) implies that 

\ A \ > 1 , 



JV ||w||oo 

Assuming that |.A'| > (| + 2e)|A|, and recalling that <5 was chosen so that S ^ 
e/4c' 1 (e), it follows from these two observations that 

* ■ (it! x 1 T ) d/i ^ | + e. 

Proposition 13.11 now implies T(4 r ) 02(e), so by the pointwise bounds ci(e) w ^ 
c[(e) we have 



T(F) = T(|| W || M 1 vI/.( w xl T )) > 



c 2 (e)ci(e) 3 



c'i(£) 3 ■ 

Write C3 (e) for this latter quantity. By Lemma IA.5I it follows that (if J- grows 
sufficiently rapidly and N is big enough), 

T(/ t0 r) > T(F) - ±c 3 (e) > ±c 3 (e). 

Finally, from Lemma lA.lOl together with the bounds ||/ S ml|U 2 (Af) ^ S 2 , ||/ un f ||f/ 2 (A r ) ^ 
1/F(M) and the choice of 6, we conclude that 

T(A') = T(/ tor + / sml + / unf ) ^ ic3(e). 

In particular A' has 3> e iV 2 solutions to a; + y = z with x ^ y. This concludes the 
proof of Theorem 13.31 and hence of Theorem 11.11 (modulo the results of the next 
two sections and the appendix). □ 
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4. Sets of doubling less than 4 

In this section we study sets A satisfying \A — A\ ^ (4 — s)\A\ or various related 
but slightly weaker conditions. Our particular aim is to prove Corollary 14. 21 below, 
which will be crucial in the construction of the weight function w in Proposition 
13.11 However, some special cases and corollaries of our main result may be of 
independent interest and we highlight these in Section |6l 

We remind the reader of our convention, mentioned at the beginning of Section 
[3l that the "natural" uniform measure on a space X, whether it be Z/gZ x [0, 1], 
T, {1, . . . , N}, etc., is denoted \i. When we want to refer to the size (i.e., counting 
measure) of a set A, particularly a subset Ac {1, . . . , N}, we will use the notation 
\A\. 

If X is a space endowed with a measure /x (one of the above) then, as usual, 
we define the convolution of two sufficiently nice functions /i , /2 : X — > C by 
fi * /a (») = / h{v)h{x - y)d/j,(y). In the case X = {1, . . . , N} we allow fx, f 2 and 
fx * f L 2 to be defined on Z\{1, . . . , N} as well, but we continue to use the measure 
fi which gives each point a mass 1/N. If A is a set and t is a real number then we 
define D t (A) — {x : Ia * 1-a(x) ^ t}, the set of "i-popular differences" of A. Note 
that T> t (A) c A- A if t > 0. 

The main result of this section is the following. 

Theorem 4.1. For every e > there is some 5 3> e 1 such that the following holds. 
If A C {1, . . . , N} is a set with \ D$(A)\ ^ 4|A| — eN then there is an arithmetic 
progression P C {1, . . . , N} of length \P\ > £ N such that \A n P\ > {\ + \e)\P\. 

The reader may find it helpful to think of the hypothesis | D^(A)| ^ 4|A| — eN 
as a slight weakening of \A — A\ ^ 4\A\ — eN. To motivate this theorem, we first 
derive the corollary which will enable us in Section [5] to construct a weight function 
satisfying Proposition [O] 

Corollary 4.2. Let e > and ij e N. Then there is 8 3> £ 1 such that if A C 
Z/gZ x [0,1] is an open set with [i(D$(A)) ^ 4/i(A) — e then there is a subgroup 
H Z/gZ of index [Z/qZ : H] <C e I, an element x G Z/gZ, and a subinterval I of 
[0, 1] o/ length /i(I) 1 suc/i i/iai A ftas density at least ^ + ^e on (x + H) x I . 

Proof. Let A C Z/gZ x [0, 1] be an open set such that fj,(Ds(A)) ^ &fj,(A) -e. Then 
for some positive integer K depending on e and A there is a subset A' C A, a union 
of sets of the form {a} x (-^-, j^), such that m(^') ^ n(A) — -^e. (Note that none 
of our final quantities can or will depend on if.) Then since A' d A, 

li(Ps{A')) sc iMp t (A)) sc 4p(A) - e < 4 M (A') - |e. 
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With an abuse of notation rename A' simply A. 

For N a large multiple of q, consider the map tt : {1, . .. , N} — s> Z/qZ x [0, 1] 
defined by Tr(n) = (n (mod q),n/N). It is clear (see Lemma [A.6[l that for large 
N the image of {1, . . . , N} under tt is highly equidistributed in Z/gZ x [0, 1]. In 
particular we have 



E n ^ N i/)(Tr(n)) = / tjj(x)dfi(x) + o K . N ^ x (l) (4.1) 

JZ/qZX [0,1] 

whenever ip is "nice" , in particular whenever ip has one of the following three forms: 

(i) the characteristic function of a union of sets {a} x 

(ii) the characteristic function of the intersection of a set of type (i) with a 
translate of another set of type (i), 

(hi) a continuous function with Lipschitz constant K . 

(Note that, conditional on one of these hypotheses, the quantity Ok-,n^oo(^-) is 
asserted to be independent of ip.) 

In particular, if B = n^ 1 (A), by case (i) of (|4.1j) we havj^ 

H{B) = E n ^ N l A (n(n)) = fi(A) + o K;W ^ 00 (l). (4.2) 
Furthermore we claim that 

K^2S{B)) < li(D 5 (A)) + K ,5;A^co(l). (4.3) 

This is a little trickier to justify. First note that by case (ii) of (|4.1j) that 

1 B * l-s(n) = K m ^ N l A (ir(m)) l A (n(m) - n(n)) 

l A (x) l A (x - n(n))d(j,(x) + o k -,n^oo(1) 



Z/qZx [0,1] 

= l A * l- A (ir(n)) + ok. n ^oo(1). 

In particular if N > Nq(K,5) is large enough then if n G D2s(B) then %(n) G 
~D 3 g/ 2 (A), that is to say if 1b * 1- B {n) 3* 2<5 then 1 A * l_^(-7r(n)) 35/2. Now 
let x '■ [0, 1] —> [0, 1] be a function such that x( x ) = 1 for a; ^ 3(5/2, x(x) = for 
x ^ 5, and x has Lipschitz constant 0(1/6). What we have shown implies that if 
N > N (K,5) then 

E„<jvX ° (1a * l_A)(7r(n)) ^ /*(D 2 i(B)). 



^Note that this would not be true if A were an arbitrary open set, for example if A were a set of 
small measure containing Z/qZ X (Qn [0, 1]). 
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Now 1a * 1-A has Lipschitz constant at most K, so x o (1a * 1-a) has Lipschitz 
constant at most 0(K/5). Thus by case (iii) of (|4.1I) . 



E n ^jvx° (1a * 1-A)(n(n)) = / x o (1^ * l A )(x)dn(x) + o KjSt N^oo(l) 

JZ/qZx[0A] 

s$ fi(D s (A)) + o K .S;N^oo(l)- 

This completes the justification of the claim (|4.3p . 

Comparing (|4.2[) and (|4.3[) and recalling the hypothesis that /i(D<5(A)) ^ 4/i(A) — 
Is, we see that if N > Nq(K,e,8) is large enough then \ T> 2S (B)\ < 4|P| - feiV. 
Choose 5 ^ 6 1 small enough that Theorem 14 . 1 1 holds with 28 in place of 8 and |e 
in place of e. Then there is a progression P C {1, . . . , N} of length L = \P\ ;§> e N, 
say P = {xo + Xd: A = 0,1,...,£- 1}, such that |PnP| > (| + |e)|P|. 

It is readily seen that the image tt(P) is highly equidistributed (as N — > oo) on 
^(^o) + H x I, where H ^ Z/ql* is the subgroup of index gcd(g, d) ^ d <C e 1, and 
7 = [0, ^1 has length ^ 3> e ,o: 1, so by a variant of (|4.1j) . case (i), we have 

|SnP| _ fi(AD{ir(x )+H x I)) 
\P\ n(7r(xo) + H x I) 

Therefore, if N is large enough depending on e and 

fj,(A n (7r(a;o) + H x I)) ^ (| + ie)/z(7r(x ) + H x I). □ 



We devote the rest of this section to the proof of Theorem l4.ll The argument uses 
several nontrivial ingredients: the arithmetic regularity lemma fLemma lA.2p again, 
a "stability" version of Kemperman's theorem due to Tao [Taoa] . [Taobi Section 3.2] 
and the Brunn-Minkowski theorem. We begin with the regularity lemma. Let the 
hypotheses be as in Theorem 14.11 thus A C {1, ...,jV} is a set with | D^(^4) | ^ 
A\A\ — eN . Let T : N — >• M+ be a growth function depending on e to be chosen 
later. Let e = min (e, y^)- Then there is some M <§C £ ,jf 1 such that 

lA = /tor + ,/sml + /unf, 

where \\f S jnl\\e 2 (N) < e 10 , ||/ unf \\u'(n) < V-^W and 

/tor = F(n (mod g), n/iV, On) 

for some P : 1/qL x [0, 1] x (R/Z) d -> [0, 1] such that q,d, \\F\\ Up ^ M and for 
some (J 7 (M),iV)-irrational 9 € (R/Z) d . As usual we abbreviate (M/Z) d to T. 

Let M = \e~ 10 M~\ and consider, for a £ Z/gZ and i S {1, . . . , M}, the progres- 
sions 



(i - iiV 
M ' ~M 



n = a (mod q) 
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Define F a ,i : T — > [0, 1] by F a .i(x) — F(a,i/M,x). Then because F is M-Lipschitz, 
F a} i is M-Lipschitz and / tor differs by at most e 10 from a function /struct which we 
define by 

M 

/struct (n) = ^2 y~]li a ,i(ri)F aii (6ri). 

a(mod q) i=l 

Absorbing the error of e 10 into / sm i, we have a decomposition 

1a = /struct + /sml + /unf 

where ||/ s ' m ill^(A) < 2e 10 and [|/ unf \\u'(n) < 1/.F(M). Now given an arbitrary 
growth function J 7 depending on e, we may choose J- to grow sufficiently rapidly 
depending on e so that F{M) ^ F{M), whence ||/unf ||c/ 2 (w) ^ V-^C^O an ^ is 
(F(M), iV)-irrational. Clearly we may now rename M as M, / s ' ml as / sm i and T as 
J 7 , so that 

lA = ./struct + /sml + /unf, (4.4) 

where \\f s ^\\p{N) < 2e 10 , ||/ un f ||c/ 2(w) < 1/F{M), 

M 

/struct (n) = ^ ^ l Ia i {n)Fg ti {6n), 

a (mod <j) i=l 

and 



(i - l)N iN 



n — a (mod q) 



M M 

Write a(a,i) for the density of A on I a ^. We will show that a(a,i) ^ \ + |e 
for some (a,i). Note that while \I a ,i\ need not be exactly N/qM, at worst it differs 
from N/qM by 2. We will deal with this small discrepancy taking N Nq(s) 
sufficiently large depending on e. This is acceptable: if N < No(e) then Theorem 
14.11 is trivially satisfied by taking P to be a suitable singleton^ 

We proceed by examining how the behaviour of 1a is modelled by the more 
"structured" functions F a ^{6n), which in view of the decomposition WM involves 
estimating the effect of / sm i and / un f- The term / sm i is the more troublesome of 
the two. The following simple lemma is useful here. 

Lemma 4.3. For all (a, i) G Z/qZ x {1, . . . , M} outside an exertional subset E of 
size at most e A qM we have E„ e j i |/ sm i("-)| *S e 5 - 

Proof. If this were not the case we would have 

1 / N 



E„<jv|/ sm i(n)| > — ^— - 2 ) qMe J ^ 2i l 



2 Alternatively, one could arrange that N is always multiple of qM, in which case \I a i\ is exactly 
N/qM. 
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whence by Cauchy-Schwarz ||/sml|U 2 (An > 2e 10 , a contradiction. □ 

Lemma 4.4. Let E be as in the preceding lemma. For all (a, i) € Z/gZx {1, . . . , M} 
outside E we have J T F Q) j a(a, i) — e 4 . 



Proof. By Lemrna [A.8l the average of / un f over any progression I a ^ is less than if 4 
provided that F grows sufficiently rapidly, and by Lemma 1431 for all (a,i) ^ E the 
average of / sm i on I a ,i is also at most |e 4 . Thus if (a,i) ^ E we have 

a(o,t) = E„ e/a t l A (n) ^ E„ eJa 4 F o ,<(0n) + |e 4 < / F„,< +e 4 . 

JT 

Assuming F grows sufficiently rapidly, the last step follows from the (F(M),iV)- 
irrationality of 9 and Lemma IA.3I □ 

We need a slightly technical lemma concerning level sets of Lipschitz functions. 

Lemma 4.5. Let r\ > 0. If J- grows sufficiently quickly depending on r\ then the 
following is true. If F : T — > [0,1] is M -Lipschitz, 9 is (F (Af)> N) -irrational and 
I C {1, . . . , N} is any progression of length at least N/M 2 , then the proportion of 
n E I such that F{n&) > n is at least fj,({x G T : F(x) > 2rj}) — n. 

Proof. We want to compute E nS /x o F(n9), where x is the cutoff l x ^ v - Replace 
X by a function x with Ixlkip C 1/f) such that x(x) = for x < r\ and x(x) = 1 
for x ^ 2r]. Then E„ e /x o F(n9) K ne jx ° F(n9). However the function x ° F is 
Lipschitz with \\x ° F||Li P <C M/77 and so, if F grows sufficiently rapidly, since 9 is 
so irrational, Lemma TA.3I implies that E„ e /x ° F(n&) ^ / T x ° F — 77. On the other 
hand the integral here is at least the measure of {x : F(x) 2ry}. □ 

The following lemma has more meat to it and is a crucial ingredient of our argu- 
ment. It encodes the fact that if Bi , B 2 are open subsets of a torus then the measure 
fj,(Bi+B2) is at least mm(fj,(Bi)+ fj,(B 2 ) , 1), a 1953 result due to Macbeath |Mac53j . 
More accurately, we require a "robust" version of this result which was obtained 
in [GR051 Proposition 6.1], and recently given the following elegant formulation by 
Tao [Taobj : if Si, S2 C T are open and ^ t ^ min(/j(5i), ^(S^)) then 

/ min(l 5l * l Sa ,t)dn Ss tmw(fJ,(Si) + [i(S 2 ) -t,l). (4.5) 
Jt 

Lemma 4.6. Let < 77 < 1 and suppose that Fl,F2 : T — > [0, 1] are M -Lipschitz 
functions such that /F l5 JF 2 2rj 1 ^. Then the measure of the set of x for which 
Fi * F 2 {x) > 77 is at least min (/ Ft + / F 2 , l) - 477 1 / 6 . 
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Proof. Let Si = {x : Fi{x) > rj 1 / 3 } for i = 1,2. Clearly fi(Si) ^ J F t - 7? 1 / 3 , so in 
particular n(Si), fj,(S 2 ) ^ ?7 1/6 . By (g3]) we therefore have 

111111(15, * ±S 2 (x),V 1/(i ) , ^ • / / C x . , e x 1/6 ,x 

Writing X for the set of x G T such that ls 1 * ls 2 ( x ) ^ V 1 ^ 3 -) the left-hand side here 
is bounded by fi(X) + r? 1/6 , so fi(X) > min(/ Fx + J F 2 , 1) - 4?? 1 / 6 . On the other 
hand, for £ G AT we certainly have F± * i<2(a;) ^ ry 2 / 3 !^ * ^-s 2 ( x ) ^ V- ^ 

Lemma 4.7. If (a,i), (a' ,i') ^ £7 and a(a,i), a(a',i') ^ 2e 2 iften 

AT 10e 2 iV 

|Dg3o /1QM 2(A) n/ -a',i-i' I > —min(a(a,i) + a(a,i),l) — , 

' qM qM 

and the same bound holds for \ / WM 2 (A) n I a -a' 

If / is a function on an abelian group we write f° for the function f°(x) = /(— x). 

Proof. Dealing with /„_„/ and / a -a' ,i— i'+i are similar, so we focus on the for- 
mer. By Lemma 14.41 then, it suffices to prove 

M 2 N 



D^o/iom 2 (-4) n I a -a',i-i' I > -^minf J F a!i + j F a >y,l 



qM 



for (o, i) and (a',i') outside £7 and such that J Fa,i> J Fa' .v £ 2 ■ 

For all except maybe 2e 2 N/qM values of d G I a -a',i-i' (those near the left ends), 

|/«,in(d + J oV )|>^, (4.6) 

and for any such d we have, if T is sufficiently rapidly growing, 

/structk,** /struct I (<9 = E F a ^{9n) £V,i' (9(d + 7l)) 

ne/a,.n(ii+/ a / i /) 
^ |7 a ,i n (d + W)| (F a , 4 * i^.i^fld) - \e 12 ) . (4.7) 

Here we used the ( J-{M), iV)-irrationality of 0, Lemma [A. 3 1 and the fact that the 
product of two M-Lipschitz functions, each of which is bounded pointwise by 1, is 
2M-Lipschitz. By Lemma IA.11| F a ,i * F°, v is also Af-Lipschitz, so again by the 
(T{M ), A^)-irrationality of 9 and by Lemma [4.51 the proportion of d G £ a -a',i-i' 
such that F at i * F° lv (6d) ^ \e 12 is at least fi(Y) - e 12 , where 

Y = {y:F a , i *F° a ,j{y)>s 12 }. 

But by Lemma [4~6l with 77 = e 12 , (j,(Y) ^ min (/ F a<i + J F a , , v , l) - 4e 2 . Putting 
this all together, 

e 14 N 

/struct)/*.* * fatruct\l a *yW ^ — — 
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for a set of d e I a -a' i-i' 01 size at least 



qM \J ' J ' ' / qM 

Now by Lemma 14.31 and Young's inequality (Lemma IA.12j) we can absorb the 
contribution of / sm i and conclude that 



(/struct + feml)\j a> i * (/struct + /sml)° , ., W 



5qM 

for these same values of d. Finally we add in the contribution of / un f- Recalling 
from (|4.4j) that I a = /struct + Ami + /unf, Lemma [OH implies that 

e 14 N 

for all d in a subset I a -a'.i-i' of size at least 

N ( f f \ 8e 2 N 

min / F a . z + / F a i m, 1 



qM V/ 7 / 

All these <i lie in D,?i4 /8 9 m(^), which is of course contained in D^2o /iom 2 (^)- ^ 

To use the bound supplied by the preceding lemma we apply the Brunn- Minkowski 
theorem, which states that if X,Y C R d are open then fi(X + ^ ^(A) 1 ^ + 

M (y)Vrf. We require the case d = 2. For a wider discussion and proof, see |Gar02] . 

Lemma 4.8. Given a function a : Z/qZ x {1, . . . , M} — » [0, 1] and (x, y) £ Z/qZ x 
{— iWf, . . . , M}, define a{x,y) = max(a(a, i) + a(a' inhere the maximum is 
taken over all (a,i), (a' ,i') e Z/qZ x {1,...,M} such that a(a,i), a(a',i') > 0, 
a — a' = x and either i — i' = y or i — i' + 1 = y. Then 

^a(x,y) ^ 42ja(o,i). 



Proof. Consider the open sets X,X' C Z/qZ x R 2 defined by 

X = [J {a} x (i — x (0, a(a, £)), 

(o,i)eZ/gZx{l,...,M} 

A' = |J {a'} x (i' - 1, i') x (-a(a', i'), 0). 

(o',i')eZ/«Zx{l,...,M} 

Note that 

A-A'= |J {a- a'} x (i - i' - l,i - i' + 1) x (0,a(a,i) +a(a',i')) 

(a,i),(a / ,z / ) 

U {x} x (j/- l,y) x (0,a(x,y)). 

(z,2/)eZ/gZx{l,...,Af} 
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where in the last equality we have ignored a set of measure zero. Thus, if v is the 
product of counting measure on Z/gZ and Lebesgue measure A on R 2 , we have 
v[X) — v{X') — J2 a i a ( a > *) an( i V {X ~ X') = J2 X y ^i x i v)- ^ therefore suffices to 
show that 

v(X -X') > 4v(X). 

The case q = 1 of this is immediate from the Brunn-Minkowski inequality. A 
simple argument allows us to extend this to general q. Indeed, let X ai X' a be the 
fibres of X, X' respectively above a G Z/gZ. Then X a ,X' a are open subsets of 
R 2 . Pick a such that X(X a ) = \(X' a ) = £\ a(a,i) is largest. If X a ^ then the 
Brunn-Minkowski inequality implies that 

\{X a -X' a J> [\(X a )^ 2 + X(X at ) 1/2 Y > 4A(X„). 

However the sets X a — X' a are disjoint as a ranges over Z/gZ, since each lies in a 
different fibre over Z/qZ. Therefore 

v{X-X')> K x a~K,)> 4A(X a ) =4A(X). □ 

a:A a #0 a:A o7 40 

In fact we need the following more robust variant of the above, easily deduced 
from it. 

Lemma 4.9. Let rj > 0. Given a function a : Z/gZ x {1, . . . , M} —> [0,1] and 
(x, y) G Z/gZ x {— Af, . . . , M}, define a{x, y) = max(a(a, i) + a(a', i')), where the 
maximum is taken over all (a,i), (a' t i) G Z/gZ x {1, ...,M} such that a(a,i), 
a(a', i') > rj, a — a' — x and either i — i' — y or i — i 1 + 1 = y. Then 

a, i 

Proof. Let 

t/ J ot(a, i) if a(a, i) > rj, 

I otherwise. 

Then if we define, as in Lemma 14.81 al(x, y) = max(a^ (a, i) + a* (a' , i')), where the 
maximum is taken over all (a, i), (a', i') such that cv(a, i), cv(a', i') > 0, a — a' = x 
and either i — i' = y or i — i' + 1 = y, then cv = a as defined above. It follows then 
from Lemma 14.81 that 

y^a(x,y) = ^2a^(x,y) ^ 4^a t (a, i) > 4^a(a,i) - 4?7gM. □ 

x.y x.y a,i a,i 

Now we are ready to put everything together and complete the proof of Theorem 
KT\ Let 6 = e 20 /10M 2 . Then certainly 5 > £ 1. Recall that a(a,i) is the density 
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of A on I a ,i- Define 

,/ .s \ot{a,i) if (a,i)£E, 
a (a, i) = < 

[0 ii(a,i)eE. 

Then Lemma T4.7I may be rephrased as follows: if a'(a,i),a'(a',i') ^ 2e 2 then 

N 10e 2 N 
| D S (A) n J _„/,(_i/ 1 > — xmn(a'(a, i) + a' {a', i'), 1) - 

with the same bound for | T)g(A) n / a -a'.j-i'+i|- It follows that 

\B S (A)\ > -^^min(d'(a;,j/),l)-10e 2 iV, 

where a' is as defined from a' as in Lemma 14.91 with 77 = 2e 2 . Recalling that 
e = min(e, j^), this implies 

Supposing that a'(x,y) < 1 + |e for all (x,y), Lemma 1431 implies 

a,i a, i 

Thus if | Dj(A)| ^ A\A\ — eN, there must be some (x, y) such that a'(x,y) ^ 1 + 1 e, 
whence for some (a,i) we must have a(a,i) ^ 5 + g£. This completes the proof of 
Theorem 14.11 

5. Construction of the weight function 

In this section we prove Proposition 13.11 by constructing an appropriate weight 
function w. The reader may wish to take this opportunity to recall the statement 
of that result. A key ingredient in the proof is the following corollary of the results 
of the Section @] It states that an "almost sum-free" open subset of 1/qL x [0, 1] 
with density larger than i must "avoid the origin" . Recall that \i is the natural 
probability measure on Z/gZ x [0, 1], namely the product of the uniform measure 
on Z/qZ and the Lebesgue measure. 

Corollary 5.1. Let e, 77 > and q G N. Suppose that A C Z/qZ x [0, 1] is an open 
set with fi(A) | +e and T(A) ^ r\. Then \i{AV\ (H x J)) <C £ 77 for some subgroup 
H ^ li/qli of index <C e 1 and some interval I around of length 3> e 1. 

Proof. We may assume that 77 < 770(e), for some 770(e) to be specified later. If not, 
the corollary is trivial by taking H = "LjqL and / = [0, 1]. Let S 3> e 1 be as in 
Corollary 14.21 Recall that T>s(A) = {x : 1a* 1~a(x) ^ S} and first suppose that 
H(D S (A)) > 4/i(A) - e. Write V> & {A)+ = D S (A) n [0, 1]. Since D S (A) is symmetric 
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about 0, we have fi(Ds(A)+) > 2fj,(A) - \e. It follows that fj,(A) + ^(D S (A) + ) > 
3fi(A) — \e > 1 + 2e, and so by the pigeonhole principle ;i(AnD{(i) + ) > 2e. This 
implies that T(A) 2eS 3> e 1. If 770(e) is small enough then this is more than 77, 
and the corollary is established in this case. 

The other possibility is that fi(Ds(A)) ^ 4/i(A) —e. In this case, by Theorem l4.2l 
there is a subgroup H TLjql of index m <C e 1 and an interval 7 C [0, 1] of length 
£ ^ e 1 such that A has density at least \ + on (x + 77) x 7, for some x S TLjqL. 
Let e' = \el, and suppose that (h,t) E H x [0,e']. Then both A n ((x + 77) x 7) 
and (A n ((x + 77) x 7)) + (/i, t) lie in [x + 77) x 7', where I' = I + [0, e']. Noting 
that /j(77 x 7) = £/m and /j(77 x 7') = (£+ e')/m, we have 

/j(in(4+(/i,f))) 2 (i + ±e)m(77 x 7) - M (77 x 7') > — . 

' m 

It follows that T(A) ^ /j(A n (77 x [0,e']))^. Since T(A) < 77, this implies that 
fi(A n (77 x [0, e 7 ])) ^ nm/e' <C e 77, and we have proved the corollary in this case 
too. □ 



Using the above corollary, we can construct a weight function on Z/QZ x [0, 1] 
for some Q <C e 1, packing most of its weight near in a certain sense, and prove 
that it satisfies Proposition ^. II Our iterative strategjl^is embodied in the following 
lemma. 

Lemma 5.2. Let e > and suppose that a ^ ^ + g£. Suppose we are given a 
Lipschitz weight function w : TLjQTL x [0, 1] — > (0, 00) and 77 > such that if Q \ q 
and AcZ/gZx [0, 1] is an open set such that T(A) n then 

w d/i a. 

Then for some Q' there is a Lipschitz weight function w' : Z/Q'Z x [0, 1] — > (0, 00) 
and rf > such that if Q' \ q' and A C Z/q'Z x [0, 1] is an open set such that 
T(A) < rf then 

I w' d/i a' , 
ii 

where a' = |a + | ( ^ + |e) . 

Proof. Apply Corollary 15.11 with |e replacing e. Thus if A C Z/gZ x [0, 1] is open, 
H(A) >\ + \e and T(A) < 7/ then n (77 x [0,e'])) < e 77', where 77 < Z/QZ 
is a subgroup of index at most C e and e' 3> e 1. Let M = C e l. Then the index 
[Z/qZ : 77] necessarily divides M, so for every x E Z/qZ we have Mx E 77. 



'This strategy was suggested to us by the proof of the contraction mapping theorem. 
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For t G [±, 1] and q G N define 7r t : Z/qZ x [0, 1] Z/MqZ x [0, 1] by 7r t (x, y) = 
(Mx,ts'y). Then, by the above, if A C Z/MqZ x [0, 1] is open, /j(A) ^ § + |e and 
T(A) < n' then /_t(A n im7r t ) < e 77'. 

Let g' = MQ and define on Z/Q'Z x [0, 1] by 



w' t (x) = ll im7rt (x) W( f 1{X I ) + f (5.1) 
/j(im7r t ) 



This definition can be made to look a little more natual as follows. Define measures 
v on Z/QZ x [0,1] and v[ on Z/Q'Z x [0,1] by i/(A) = / l A wdn and 4(A) = 
/ ^Aw' t d/i. Then the relationship between v and 4 is v' t — ^(irt)*v + i/x, where the 
push-forward measure is defined as usual by Tr*i/(A) = i/(tt~ 1 (A)). 

Now suppose Q' \ q' ', say g' = where Q I <7j an d A C Z/q'Z x [0, 1] is open and 
T(A) ^ 77'. If /x(A) ^ I + g£ then, as noted above, /i(A n im7r 4 ) <C e 77'. Therefore 

(7T t ).i/(A) = ufo 1 (A)) < [MUM^r 1 ^)) = Moo ^P^^ « E r/IMU 

so 4(A) ^ O e (r7 / ||u'|| 00 ) + |, and this can be made to be less than | by taking 77' 
sufficiently small depending on e and ||t7j||oo- 

Suppose instead fi(A) < | + |e. If 77' ^ /j(im7ri/2) 3 77 then we have T(7r t _1 (A)) < 
77 for all t G [i, 1], so in this case we have 

4(A) = fi/fa" 1 ^)) + ± M (A) ^ |« + I (I + | £ ) = a '. 

To complete the proof, we must show that to' can be chosen to be Lipschitz. In 
fact w' t generally has a jump discontinuit}Q at every point of the form (Mfc,te'), 
but we can remedy this by defining 



w' = 2 w' t dt. (5.2) 
Then if Q' q' , A C Z/q'Z x [0, 1] is open and T(A) < 77' then 
w' d/x = 2 / 4(A) dt ^2 [ a' dt = a', 

h h 

while from (|5.ip and (|5 .2[) it is fairly clear that w' is Lipschitz. □ 
By applying Lemma 15.21 iteratively we get a weak version of Proposition 13.11 

Proposition 5.3. Let e > 0. Then there is an integer Q and a Lipschitz weight 
function w : Z/QZ x [0, 1] — > (0, 00) with the following property. Lf Q \ q, then for 
any open set A C Z/qZ x [0, 1] such that J lAwdfi | + e we have T(A) 3> e 1. 



^This would not, in actual fact, be a fatal hole in our argument; in Section[3]we could instead have 
dealt with the larger class of piecewise Lipschitz functions. This is a little complicated, however, 
and the Lipschitz hypothesis is convenient. 
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Proof. Apply Lemma 15.21 itcrativclv starting with Q = 1, w = 1, a = 1, and rj = 1. 
After n = 100 log(l/£) steps we obtain an integer Q and a weight w on Z/QZ x [0, 1] 
satisfying the hypotheses of that lemma with some rj > and 

«-(I)" + i(i + ! + (!) 2 + --- + (tr 1 ) + <! + i* □ 

To obtain Proposition 13.11 from Proposition 15 .31 we must replace 1a by an ar- 
bitrary continuous function "J, and we must introduce the additional factor of 
T = (R/Z) d . Both of these improvements turn out to be relatively straightforward. 

Proof of Proposition \3.1[ We will show that w, the weight function on Z/QZ x [0, 1] 
constructed in Proposition 15.31 has property required by Proposition 13.11 We do 
this in stages, beginning with the following. 

Claim I. Consider the "discrete torus" Tdi SC = Z/giZ x • • • x Z/q^Z, where 
q±, . . . , qd > q are distinct primes. Suppose A C Z/gZ x [0, 1] x T d i S c is open and 
/ l A (w x l Tdlsc )d/i ^ | + \e. Then T(A) > £ 1. (Here, Z/QZ x [0,1] x T disc is 
endowed with the uniform probability measure /j,.) 

Proof of Claim I. Let q' — qq± . . .qd and consider the /i-preserving isomorphism 
Z/</Z x [0, 1] Z/gZ x [0, 1] x T disc 

given by 

ip(x,y) = (x (mod q),y,x (mod gi), . . . , x (mod qd))- 

If A C Z/gZ x [0,1] x Tdisc is open and J 1a(w x l Tdiac )c?^ ^ | + |e then 
/ l r i (A) KJ(i/i ^ | + js, so Proposition IQ1 implies T(A) = T(ifj^ 1 (Aj) > e 1. 

Claim II. The same claim holds if Tdi sc is replaced by the genuine torus T = 
(R/Z) d and \e is replaced with \e. That is, if A C Z/QZ x [0, 1] x T is open and 
/ l A (w x 1 T ) ^ i + \e then T(A) > £ 1. 

Proof of Claim II. This is a standard discretisation argument. We may find 
some k > and a subset A' C A such that A' is a disjoint union of sets of the form 
{x} x I x J, where / c [0, 1] is an open interval of length k and J C T is an open 
cube of side-length k, and J 1a> (w x It) ^ | + |e. Regard Tdisc as a subgroup of 
T by mapping (a ls . . . , x d ) € T disc to (fi, . . . , |£) e T. Set 

A' disc = A'n (Z/QZ x [0, 1] x T^sc). 

Then as qx,...,qd ~ > oo both J 1m (w x lT di3C )d/i — > / 1a'(w x It) <i/i and 
T(A disc ) -> T(A'), so by the previous claim T(A) ^ T(A') > £ 1. 

Finally, we are ready to verify Proposition 13. II itself, which differs from Claim II 
only in the presence of a general continuous function *S> : 1/qL x [0, 1] xT-> [0, 1] 
in place of the characteristic function I a- Suppose that \& is given and J W • (mj x 
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It) dfi > | + e. Consider the open set A = {x : &(x) > \e) C TLjqL x [0, 1] x T. 
Since 

\ + £ ^ J * • (w x 1 T ) = y * • (iu x 1 T ) + J * • (w x 1 T ) < y U(w x 1 t ) + ^e, 

we have / U(w x It) ^ | + | £ - By Claim II we therefore have T(A) ^> £ 1, and 
thus T(*) > (ie) 3 T(A) > e 1. This (at last!) completes the proof of Proposition 
13.11 and hence of Theorem 11.11 □ 



6. More on sets of doubling less than 4 

The purpose of this section is to expand just a little more on the results of 
Section 2J which may be of independent interest. The first theorem below is a 
direct consequence of Theorem 14.11 the second is the corresponding consequence of 
Corollary 14. 2 1 in the case Q = 1. 

Theorem 6.1. If A C {1,...,N} is a set such that \A - A\ < A\A\ - eN then 
there is an arithmetic progression P C {1, . . . , N} of length 3> £ N on which A has 
density at least \ + 

Theorem 6.2. If A c [0, 1] is an open set such that \A — A\ ^ A\A\ — e then there 
is an interval I C [0, 1] of length 3> e 1 on which A has density at least | + ^e. 

Remarks. 

(i) Neither the constant \ in Theorem 16. II for Theorem 14. ip nor the constant 
j in Theorem 16.21 is optimal. Indeed, if one allows the implied constants 
to depend on rj then our proof can be modified to get j — rj for both these 
constants. 

(ii) A similar conclusion to Theorem 16 . 2 1 could be obtained if one instead had 
two sets A and B satisfying \A-B\< \A\ + \B\ + 2\A\ 1 / 2 \B\ 1 / 2 - e. The 
conclusion would then be that there are intervals Ia and Ib such that 
the densities of A, B on I a, Ib respectively sum to at least 1 + |e (or up 
to 1 + (5 — f])e, constants depending on if). There would be a similar 
generalisation of Theorem 16.11 We leave the proof of these results as an 
exercise to the interested reader. One annoying additional complication 
would be the need to have an arithmetic regularity lemma valid for two 
sets simultaneously. While such a statement can be easily established by 
modifying the arguments of [GT10] . no such result currently appears in 
the literature. 

(iii) We are not aware of any reason that the length of / or P could not be 
bounded below by some quite reasonable function of e, but our argument 
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does not give one. Much better quantitative results are available under 
the assumption that — < 3\A\: see |Ruz91j . 

Consider the discrete case A C {1, . . . , N}. Note that the hypothesis \A — A\ ^ 
A\A\ — eN implies \A\ ^ \sN. Thus one can consider Theorem 16.11 to contain a 
"hidden hypothesis" to the effect that A is somewhat dense in {1, ... , N}. If instead 
one assumed only that — ^ (4 — e)|A|, then our argument would give bounds 
depending on a = \A\/N as well as e. Using a "Freiman modelling" argument of a 
type pioneered by Ruzsa, however, we can overcome this. We first isolate a lemma 
due to Lev |Lev97j (though earlier results of Sarkozy |Sar89j would also suffice). 

Lemma 6.3. Let P be a finite arithmetic progression of length greater than 12, 
and let X C P be a set with \X\ > h\P\. Then 5X — AX contains P. 

Proof. The statement of the lemma being affine-invariant, we may suppose without 
loss of generality that X = {1, . . . , N}. Since \X\ > N/2, the highest common 
factor of the elements of X is 1. By |Lev97| Lemma 1] with k = 2 (and a short 
computation), AX contains an interval of length at least — 3) > N. It follows 
that AX — AX contains {— N, . . . , N}, and the result follows immediately. □ 

In the proof of the next lemma we will use the notion of a Freiman homomor- 
phism. See |TV10i Definition 5.21] for details. 

Theorem 6.4. Let e > 0. Suppose that A is a finite set of integers such that 
\A — A\ ^ (4 — e)|^4|- Then there is an arithmetic progression P C Z of length 
3> e \A\ on which the density of A is at least | + ce. 

Proof. By [GR061 Theorem 1.4], every set A C Z with \A - A\ ^ A\A\ is Freiman 
18-isomorphic to a subset of {1,...,N} for some N <C \A\. Let tt : A — s> A' C 
{1, . . . , N} be this Freiman isomorphism. Then clearly \A'\/N ^> 1, so \A' — A'\ ^ 
4 1 A' — e'N with e' 3> e. By Theorem 16.11 to A', there is a progression P' C 
{1, ...,N} of length \P'\ > e N on which the density of A' is at least | + -|e'. 
Finally, by the preceding lemma, P' C 5A' — AA\ so it follows from basic facts about 
Freiman homomorphisms (see |TVT0| Section 5.2] for example) that ir^ 1 : A' — > A 
induces a Freiman 2-homomorphism tt^ 1 : P' — > Z coinciding with 7r _1 on A' HP'. 
The image P = 7r _1 (P') is then a progression of length ^ e \A\ on which A has 
density at least \ + ce. □ 

Remarks. 

(i) The value for c given by this argument is something like 2~ . 

(ii) Under stronger conditions such as |A + ^4| < 3\A\ or \A — A\ < 3\A\, more 
precise information can be obtained: see |Fre731 Theorem 1.9] or |LS95j . 
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(iii) Statements of the same form as Theorem l6.4[ but with h + ce replaced by 
some small quantity /(e) > 0, follow from versions of Freiman's theorem 
|Fre731 Theorem 2.8], |Bil991 Theorem 1.2]. These statements come with 
more effective lower bounds on the length of P. 

Appendix A. Regularity and counting lemmata 

In this appendix we collect some tools used in the main part of the paper. All 
of these are more or less standard, or at least have easily quotable references. 

The arithmetic regularity lemma. We begin with the arithmetic regularity lemma, 
the main result of |GT10j . used twice in the paper. As reassurance to the reader 
who views that paper with trepidation, we remark that the majority of it is given to 
applications, and only Sections 1 and 2 are relevant to us. Furthermore, that paper 
establishes a regularity lemma for the Gowers U s+1 -norm for general s, whereas 
we only need the case s = 1. This means that the notion of a nilsequence, beyond 
the abelian case, is not relevant here. A complete, self-contained proof of the arith- 
metic regularity lemma in the form we need it here can be written up in less than 
10 pages. The first- named author has provided such a write-up online [Ebej . 

We begin by defining a quantitative notion of irrationality for vectors 9 £WL d . 

Definition A.l. Suppose that 9 £ M. d . Let N ^ 1 be an integer and let A > be 
some real parameter. We say that 9 is (A, iV)-irrational if whenever q\, . . . ,qd are 
integers, not all zero, with J2i ^ A we have \\q\9\ + ■ ■ ■ + qdOdWrn/z ^ A/N. 

Lemma A. 2. Suppose we are given a parameter S > and a growth function 
T : N — > ELf. Then for any function f : {1, . . . , N} —¥ [0, 1] there is an M <C<5..f 1 
and a decomposition f = / tor + / sm i + / un f into functions taking values in [—1,1], 
where \\f a mi\U 2 (N) < \\f uni \\ V '(N) < 1 /J 7 {M) and / tor (n) = F(n(modq),n/N,9n) 
for some q,d ^ M and some function F : Z/qZ x [0,1] x (K/Z) d — > [0,1] with 
Lipschitz constant at most M. Furthermore the element 9 £ (M./Z) d may be taken 
to be (F(M), N) -irrational. 

Here ||5||c/ 2 (A') is the Gowers U 2 (iV)-norm, whose definition will be recalled be- 
low. We do not offer a proof of this lemma, but merely a guide to extracting this 
result from jGTIO] . The function / tor written here is the same thing as, in the 
language of that paper, a "(J-"(M), A^)-irrational virtual nilsequence of degree ^ 1, 
complexity ^ M and scale N" . Once we have justified this assertion, Lemma TA.2I 
is essentially the same as |GT101 Theorem 1.2], the proof of which occupies Section 
2 of that paper. The definition of an irrational virtual nilsequence of degree ^ s is 
rather long and complicated for general s, but for s = 1 a great deal simplifies: a 
filtered nilmanifold of degree 1 and complexity ^ M (cf. |GT10| Definition 1.4]) is 
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just the torus (K/Z) d for d ^ M, a polynomial orbit of degree l(cf. [GT10| Defini- 
tion 1.7]) is just a sequence of the form n i-> On for some E (M/Z) d and a virtual 
nilsequence of degree 1 and complexity ^ M at scale N (cf. |GT1Q[ Definition 
1.9]) is just a function F(n (mod q),n/N,0n) with G (R/Z) d , d,q, \\F\\u P < M. 
Finally, an (A, 7V)-irrational sequence (cf. |GT10[ Definition A.6]), in the case that 
the sequence has the form n4 On where G (R/Z) d , coincides with the notion of 
irrationality defined above. 

Equidistribution and counting. If E (M/Z) d is highly irrational in the sense 
of Definition IA.1| then the sequence On is highly equidistibuted on (R/Z) rf as n 
ranges over fairly long progressions. Moreover, the triple (n (mod q),n/N,nO) is 
highly equidistributed in Z/gZ x [0, 1] x nO as n varies over {1, . . . , N}. We prove 
various statements of this type required in the main body of the paper. The first 
is quite classical. 



Lemma A.3. Suppose that G (R/Z) d is (A, N) -irrational, and let F : (R/Z) d -> 
C be a function with Lipschitz constant at most M . Suppose that P C {1, . . . , N} 
is an arithmetic progression of length at least nN. Then, provided that A > 
Ao(M,d,n,6) is large enough, 



E 



F(0n) 



Fdn 



Proof. The key here (as usual in equidistribution theory) is to take a Fourier ex- 
pansion of F and truncate it. In particular, we may find Mq = Om,<i,s{^) and 
coefficients c rn with cq — J F and c m = 0M,d(l) for ra/0 such that 



0/2 



F(x) - c m e(m • x) 

uniformly in x. For a proof, see for example |GT081 Lemma A. 9]. It follows, of 
course, that 

8 
2' 



E neP F{0n)- J Fdu s: \c m \\E neP e{m ■ 0n)\ + 



|m|<Afo,rrt#0 

Thus we need only show that 

E ne _pe(m • On) = o TOj7); a->oo(1), 

and then take A sufficiently large. If the common difference of the arithmetic 
progression P is h, then by summing the geometric progression we have the bound 

E n gpe(m • On) <C 



»?J\ni(m-0)fc|| R/z 
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If A > |/i|(|toi| + • • • + \rrid\) (where m = (mi, . . . ,rrid)) then, by the definition of 
(A, iV)-irrationality, ||(m • 9)h\\^n ^ A/N. The result follows immediately. □ 

Our second result is a little more involved, but is proved in essentially the same 
way as the last lemma. It states that if 9 is highly irrational and N is sufficiently 
large then (n (mod q), n/N, 9n) is highly equidistributed in Z/qZ x [0, 1] x (R/Z) d . 



Lemma A. 4. Suppose that 9 £ (R/Z) d is (A, N) -irrational. Let 5 £ N, and let 
F : Z/qZx [0, 1] x (R/Z) d — > C be a function with Lipschitz constant at most M . Let 
5 > be arbitrary. Then, provided that A > Ao(M,q,d,S) and N > No(M,q,d,5) 
are large enough, 



E n<N F(n(modq),n/N,dn) - / Fd/i 



s: S. 



Proof sketch. Again the idea is to take a truncated Fourier expansion of F, but be- 
cause F\ z/qZx{Q}x(&/Ii)d and i*1z/ ? zx{i}x(R/z)<* ne ed not agree the expansion looks 
a little more complicated. The key point is that F can be extended to an M- 
Lipschitz function Z/qZ x [—1, 1] x (K/Z) d — > C such that F(x, y, z) — F(x, —y, z), 
so F may be approximated by a sum of the functions <p a .m,m given by 



a m 
4>a,m,m{x, y, z) = e [ -x + —y - 



m • z 



a m 
e l —x y + m • z 

,q 2 



(A.l) 



where a £ Z/qZ,m € Z and m G Z d . Then just as in the proof of the previous 
lemma we need only check that 

E n ^jv0a,m,m(^ (mod q),n/N,9n) (1) (A.2) 

provided that a, m, m are not all zero. Substituting in, the left-hand side is 



I I a m 
e h m • 

Summing the geometric progressions, we see that this is bounded by e unless 



m-0 



1 

TVe' 



q 2N 

Supposing first that m ^ 0, inequality (|A.4[) implies 

q 



(A.4) 



mq 

2N +qm - 9 



and hence 



/mi q m q 

\™'-9\\ m « — + —, 



where m' = qm. If A is sufficiently large in terms of e, q, m and m, this is contrary 
to the (A, A?")-irrationahty of 9. 
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Now suppose that m = 0. Then if N is large enough depending on m and q, 
(|A.4[) implies that a = 0. Thus a = m = 0, and hence m ^ 0. Then the expression 
(IA.3I) is E n ^ N (e(mn/2N) + e(-mn/2N)) = 0, so (|A.2|) certainly follows in this 
case as well. □ 

The next result is a kind of "counting lemma" . It eventually allows one to relate 
summing triples (x,y,x + y) in A C {1, . . . ,N} with summing triples in (K/Z) d 
weighted by / tor . 

Lemma A. 5. Suppose that 9 £ (R/Z) d is (A, N) -irrational. Let q E N, and let 
F : Z/qZ x [0, 1] x (R/Z) d ->Cfca function with Lipschitz constant at most M . 
Let f(n) = F(n(modq),n/N,9n). If A > A (q,M,s) and N > N (q,M,e) are 
large enough then 

\T(f) - T( F )\ = \T{i,...,N}(f) - Tz/ 9 zx[o,i]x(R/z) d (^)l ^ £ - 

Proof sketch. Write ir(n) = (n (mod q), n/N, On). We showed in Lemma TA.4I that, 
if 6 is highly irrational, 7r(n) is highly equidistributed in X = Z/qZ x [0, 1] x (R/Z) d 
as n ranges in {1, . . . , N}. It follows that as n, n' range over {1, . . . , N}, the pair 
(7r(n), 7r(n')) is highly equidistributed in X x X, and in particular 



E„,»/ e {i,...,jv}^*Mn),7r(n')) ~ / F„(x,x / )du(x)du(x') 

for any Lipschitz function F* : X x X — > C. Applying this with F*(x, x') = 
F(x)F(x')F(x + x') gives the stated result. □ 

Finally, we require the following simple result which does not mention at all. 

Lemma A. 6. Let q £ N. Suppose that w : Z/qZ x [0, 1] — > C has Lipschitz constant 
at most M . Then 

EnsiNw{n(modq),n/N) = / w dfx + o g ,Af;JV->-oo(l)- 

Proof sketch. Split into progressions P a = ^ A^ : n = a (mod g)}. Then one 
need only show that 

^neP a w(a, n/N) = / w(a,y)dy + o M -,N^oo(l), 



ii 



which is fairly obvious from the definition of the Riemann integral. □ 

Properties of the Gowers U 2 -norm. The statement of the arithmetic regularity 
lemma involved the Gowers [^(A^-norm of a function. Here we recall some basic 
properties of this norm, whose proofs may be found in several places. We begin by 
recalling the definition. For a fuller discussion, see |GT10j . 
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Definition A. 7. Let / : {1, . . . , N} — > C be a function. Then we define ||/||c/ 2 (jv) — 
ll/l|j/ 2 (G)/l|l{i,...,A r }ll;7 2 (G)i where G = Z/N'Z for some arbitrary N' > AN and 

WfWuHG) = E xM>h2eG f(x)f(x + hi)f(x + h 2 )f(x + In + h 2 ). 

In this definition, / is regarded (by abuse of notation) as a function on G by 
defining f{x) = if x S G\{1, . . . , N}, where {1, . . . , N} is regarded as embedded 
in G in the natural manner. It is not hard to see that this definition does not 
depend on the exact choice of N'. Introducing the group G is a technical device 
which is useful in several parts of the theory. 

We begin with a standard lemma. 

Lemma A. 8. Suppose that f : {1, . . . , N} — > C is a function. Then |E T i^A r /("-)| *C 
||/||(/ 2 (Ar)- More generally suppose that P C {1, . . . , N} is a progression of length 
at least r/N. Then |E ne p/(n)| -c V^ 1 \\f\\u 2 (N)- 



Proof. We establish the second statement, the first being a special case of it. Fix 
a prime N' G [4N, 8N] and write G = Z/TV'Z as in the definition of the Gowers 
U 2 (AT)-norm. Note that the £/ 2 (iV)-norm and the [/ 2 (G)-norm are comparable up 
to an absolute constant. We use the inequality 



\E xeG f(x)g(x)\ 




Here the Fourier transform f(r) — ~R X £of(x)e(— rx/N') is the discrete Fourier 
transform on G, and we have used Holder's inequality and the well-known fact 
(see, for example, [TV101 Chapter 11]) that ||/||c/2(g) = Taking g = 1 P , the 

characteristic function of the progression P, it suffices to show that ^2 r \g(r)\ 4 ^ 3 = 
0(1). Dilating, we may assume that the common difference of P is 1. But then we 
have, upon summing the geometric progression, the bound \g(r)\ <C min(l, H _1 ), 
from which the result follows immediately. □ 



The next lemma is a more complicated 

Lemma A. 9. Suppose that d,q,M 6 N 
5*(d,q,M,8) > and all sufficiently large 



result along similar lines. 

and that 5 > 0. There for some — 
N No(d, q, M, S) the following is true. 
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Let f(n) = F(n (mod q), n/N, On), where F : 1/qL x [0, 1] x (K/Z) rf ->• [0, 1] is M- 
Lipschitz and 9 E (M./Z) d , and suppose g : {1, . . . , N} — > [— 1, 1] satisfies \\g\\u 2 (N) ^ 
5*. Then \E neN f(n)g(n)\ < 6. 

In other words, the "structured objects" and the "pseudorandom objects" of the 
regularity lemma do not correlate. 

Proof sketch. As in the proof of Lemma [A.4[ the idea is to decompose Fasa Fourier 
expansion of length 0<5,d, q ,M(l)j plus a uniformly small error: 

a,m,m 

where 4> a ,m, m is given by (IA.1|) . \c a , m , m \ ^ 1 and H-Fsrollk 00 «S Note 

4> a ,m, m {n (mod g), n/iV, 6»n) = e((3 + n) + e(/3_n), (A. 5) 

where f3± = ^±^+m-(9 (though this exact form is unimportant). However, writing 
e(/3n) = e((3(n+hi+h2))e(— (3hi)e(— (3h 2 ) and averaging over h\,h% it follows by the 
Gowers-Cauchy-Schwarz inequality [TV101 Chapter 11] that \E n <^ N e((3n)g(n)\ <C 
IMIt/ 2 (jV) uniformly in S M, so 

|En«:Ar0a,m,m(^(mod <?), n/N, 6n)g(n)\ < 5» 

uniformly in a, m, m. It follows that 

\E n ^ N f(n)g(n)\ = \E neN F{n (mod q),n/N,6n)g{n)\ 

^ ^ |E„^iv</>a,m,m(«(niod q),n/N, 0n)g(n)\ + \5 

a,rn,m 

< 0«,d,«,Af(<y*) + <5/2, 
so for 5* sufficiently small depending on 6, d, q and M, \E n <^N f {n)g(n)\ ^ <5. □ 

Miscellany. We turn now to some rather miscellaneous lemmas. Recall that if 
/ : {1, . . . , N} — > C is a function then T(f) = En, n %N f(n)f(n')f(n + n'). 

Lemma A. 10. Suppose that f,f : {1, ...,N} — > [—1,1] are functions. Then 
\T(f) - T(/)| ^ 7||/ - and \T(f) - T(f)\ « ||/ - f\\ m[K) . 

Recalling that || • ||^i (J v) < || • \\jP(n) < II • ||oo, we also have \T(f) - T(/)| < 
7||/ - /lU-(iv) and \T(J) - T(/)| < 7||/ - /|U. 

Proof. Write g = f — f. Writing f = f + g, T(f) may be expanded as a sum of 
8 terms, one of which is T{f), the other 7 of which are trilinear terms of the form 
^n,n' fi{ n )h( n ')h{ n + n ') w ith at least one of the fi being equal to g. Using the 
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hypothesis that all the fi's are bounded by 1, the estimate 

\E n .n' fi(n)f 2 (n')f 3 (n + n')\ < \\fi\\ eHN) 

is an easy consequence of the triangle inequality. 

The case of the Gowers U 2 (N)-norm is dealt with in a similar way, using instead 
the bound 

|E n ,„>/i(n)/ 2 (n')/3(n + n')| « UWu^y 

This is an instance of a generalised von Neumann theorem, for which there are many 
references including |TV101 Lemma 11.4]. □ 

Lemma A. 11. Let X be a compact metric abelian group endowed with a translation- 
invariant metric d and a translation-invariant probability measure fi. Suppose that 
f : X — > C is a function with ||/||Lip *S K, thus \f(x) — f'{x)\ ^ Kd{x,x'). Let 
g : X — > C be any continuous function with \\g\\oo ^ 1- Then the convolution 
f * g{x) = J f(y)g(x — y)dfi(y) also has Lipschitz constant at most K . 

Proof. We have 

/ * g(x) - f * g(x') = J (f(x -y)- f(x' - y))g{y)dn(y) 

< [ \f(x-y)-f(x'-y)\dn(y) 



^ K s\xpd(x — y,x — y) = Kd(x, x ). □ 
y 

The next lemma is an instance of Young's inequality, but we include the (short) 
proof for ease of reference. 

Lemma A. 12. Let P,P' C {1,...,N} be arithmetic progressions with the same 
length. Let f : P — > C and g : P' —> C be two functions. Suppose that both are 
bounded pointwise by 1 and that either ¥, ne p\f(n)\ ^ rj or E„ e p/ \g{n)\ ^ rj. Write 
f * 9(n) 4E m f(m)g(n - m). Then \\f * < n\P\/N. 

Proof. Suppose that E ng p|/(n)| ^ rj. Then we have 



\f*9(n)\ = 



^2f(m)g(n - m) 



The case E n£ p\g(n)\ ^ n is similar. □ 

Lemma A. 13. Let f,f,g : {1,...,N} — > [—1,1] be functions such that \\f — 

f\\u 2 (N) *S S. Then \ f * g(d) — f * g(d)\ ^ for all except at most 40(5 N values 
ofd. 
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Proof. The functions / and g may be regarded as functions on G = Z/TV'Z, where 
N' = AN , in a natural way. Let h = f — /. Then 

E xeG \E yeG h(y)g(x - y)\ 2 = ]T |Mr)| 2 |g(r)| 2 

r 

= ll^ll^(G)ll5ll^( G )^ll^ll^(G)^105 2 . 

Once again we have used basic facts about the £/ 2 (G)-norm and the discrete Fourier 
transform as may be found in [TV10L Section 4.2]. Replacing the expectations over 
G by sums we obtain 

J2\ h *9( x )\ 2 < 64(« 2 7V. 

X 

Thus there cannot be more than AOSN values of x for which \h * g(x)\ ^ 45 1 / 2 . □ 
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